Code Smell Detection Using Ensemble Machine Learning Algorithms

نویسندگان

چکیده

Code smells are the result of not following software engineering principles during development, especially in design and coding phase. It leads to low maintainability. To evaluate quality its maintainability, code smell detection can be helpful. Many machine learning algorithms being used detect smells. In this study, we applied five ensemble two deep Four datasets were analyzed: Data class, God Feature-envy, Long-method datasets. previous works, stacking dataset results found acceptable, but there is scope improvement. A class balancing technique (SMOTE) was handle imbalance problem The Chi-square feature extraction select more relevant features each dataset. All obtained highest accuracy—100% for with different selected sets metrics, poorest accuracy, 91.45%, achieved by Max voting method Feature-envy twelve metrics.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fault Detection of Anti-friction Bearing using Ensemble Machine Learning Methods

Anti-Friction Bearing (AFB) is a very important machine component and its unscheduled failure leads to cause of malfunction in wide range of rotating machinery which results in unexpected downtime and economic loss. In this paper, ensemble machine learning techniques are demonstrated for the detection of different AFB faults. Initially, statistical features were extracted from temporal vibratio...

متن کامل

Using Machine Learning Algorithms for Automatic Cyber Bullying Detection in Arabic Social Media

Social media allows people interact to express their thoughts or feelings about different subjects. However, some of users may write offensive twits to other via social media which known as cyber bullying. Successful prevention depends on automatically detecting malicious messages. Automatic detection of bullying in the text of social media by analyzing the text "twits" via one of the machine l...

متن کامل

A Modified Stacking Ensemble Machine Learning Algorithm Using Genetic Algorithms

Distributed data mining and ensemble learning are two methods that aim to address the issue of data scaling, which is required to process the large amount of data collected these days. Distributed data mining looks at how data that is distributed can be effectively mined without having to collect the data at one central location. Ensemble learning techniques aim to create a meta-classifier by c...

متن کامل

Impulse Noise Detection in OFDM Communication System Using Machine Learning Ensemble Algorithms

An impulse noise detection scheme employing machine learning (ML) algorithm in Orthogonal Frequency Division Multiplexing (OFDM) is investigated. Four powerful ML's multi-classifiers (ensemble) algorithms (Boosting (Bos), Bagging (Bag), Stacking (Stack) and Random Forest (RF)) were used at the receiver side of the OFDM system to detect if the received noisy signal contained impulse noise or not...

متن کامل

Machine learning algorithms in air quality modeling

Modern studies in the field of environment science and engineering show that deterministic models struggle to capture the relationship between the concentration of atmospheric pollutants and their emission sources. The recent advances in statistical modeling based on machine learning approaches have emerged as solution to tackle these issues. It is a fact that, input variable type largely affec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2022

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app122010321